AITopics | data practitioner

Collaborating Authors

data practitioner

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

The Evolution of LLM Adoption in Industry Data Curation Practices

Qian, Crystal, Liu, Michael Xieyang, Reif, Emily, Simon, Grady, Hussein, Nada, Clement, Nathan, Wexler, James, Cai, Carrie J., Terry, Michael, Kahng, Minsuk

arXiv.org Artificial IntelligenceDec-20-2024

As large language models (LLMs) grow increasingly adept at processing unstructured text data, they offer new opportunities to enhance data curation workflows. This paper explores the evolution of LLM adoption among practitioners at a large technology company, evaluating the impact of LLMs in data curation tasks through participants' perceptions, integration strategies, and reported usage scenarios. Through a series of surveys, interviews, and user studies, we provide a timely snapshot of how organizations are navigating a pivotal moment in LLM evolution. In Q2 2023, we conducted a survey to assess LLM adoption in industry for development tasks (N=84), and facilitated expert interviews to assess evolving data needs (N=10) in Q3 2023. In Q2 2024, we explored practitioners' current and anticipated LLM usage through a user study involving two LLM-based prototypes (N=12). While each study addressed distinct research goals, they revealed a broader narrative about evolving LLM usage in aggregate. We discovered an emerging shift in data understanding from heuristic-first, bottom-up approaches to insights-first, top-down workflows supported by LLMs. Furthermore, to respond to a more complex data landscape, data practitioners now supplement traditional subject-expert-created 'golden datasets' with LLM-generated 'silver' datasets and rigorously validated 'super golden' datasets curated by diverse experts. This research sheds light on the transformative role of LLMs in large-scale analysis of unstructured data and highlights opportunities for further tool development.

large language model, machine learning, natural language, (19 more...)

arXiv.org Artificial Intelligence

2412.16089

Country:

North America > United States (0.70)
Europe (0.67)

Genre:

Research Report > Experimental Study (1.00)
Questionnaire & Opinion Survey (1.00)
Personal > Interview (0.88)
Research Report > New Finding (0.87)

Industry: Information Technology (1.00)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

The data practitioner for the AI era

MIT Technology ReviewJun-10-2024, 14:00:00 GMT

Data practitioners are among those whose roles are experiencing the most significant change, as organizations expand their responsibilities. Rather than working in a siloed data team, data engineers are now developing platforms and tools whose design improves data visibility and transparency for employees across the organization, including analytics engineers, data scientists, data analysts, machine learning engineers, and business stakeholders. This report explores, through a series of interviews with expert data practitioners, key shifts in data engineering, the evolving skill set required of data practitioners, options for data infrastructure and tooling to support AI, and data challenges and opportunities emerging in parallel with generative AI. The report's key findings include the following: This content was produced by Insights, the custom content arm of MIT Technology Review. It was not written by MIT Technology Review's editorial staff.

artificial intelligence, deep learning, machine learning, (4 more...)

MIT Technology Review

Technology:

Information Technology > Data Science (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning > Generative AI (0.31)

Add feedback

Qrlew: Rewriting SQL into Differentially Private SQL

Grislain, Nicolas, Roussel, Paul, Agathe, Victoria de Sainte

arXiv.org Artificial IntelligenceJan-11-2024

This paper introduces Qrlew, an open source library that can parse SQL queries into Relations -- an intermediate representation -- that keeps track of rich data types, value ranges, and row ownership; so that they can easily be rewritten into differentially-private equivalent and turned back into SQL queries for execution in a variety of standard data stores. With Qrlew, a data practitioner can express their data queries in standard SQL; the data owner can run the rewritten query without any technical integration and with strong privacy guarantees on the output; and the query rewriting can be operated by a privacy-expert who must be trusted by the owner, but may belong to a separate organization.

mechanism, query, relation, (14 more...)

arXiv.org Artificial Intelligence

2401.06273

Country: North America > United States > New York > New York County > New York City (0.04)

Genre: Research Report (0.40)

Industry: Information Technology > Security & Privacy (1.00)

Technology:

Information Technology > Security & Privacy (1.00)
Information Technology > Information Management (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
(2 more...)

Add feedback

Breaking The AI Bias: How To Define Fairness To Deliver Fairer Models

#artificialintelligenceNov-9-2022, 11:16:10 GMT

At a basic level, AI learns from our history. Unfortunately, much of societal history includes some discrimination and inequality. It's therefore essential that data practitioners consider this in their work as AI built without acknowledgement of bias will replicate and even exacerbate this discrimination. This is particularly concerning when you consider the influence AI is already exerting over our lives. McKinsey's recent digital trust survey found that less than a quarter of executives are actively mitigating against risks posed by AI models (this includes fairness and bias).

deliver fairer model, fairness, practitioner, (13 more...)

#artificialintelligence

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Issues > Social & Ethical Issues (1.00)

Add feedback

Synthetic Data and the Data-centric Machine Learning Life Cycle

#artificialintelligenceNov-9-2022, 07:46:12 GMT

In this series of posts, we'll cover how Gretel's synthetic data platform helps you overcome challenges across the data-centric machine learning life cycle to help you successfully build, deploy, maintain, and realize value from your AI projects. The life cycle outlined below is a common framework or workflow process for building machine learning and AI solutions. It's focused on streamlining the stages necessary to develop machine learning models, deploy them to production, and maintain and monitor them. These steps are a collaborative process, often involving data scientists and DevOps engineers. The process below was inspired by the value chains created by The Sequence, Databricks, Google Cloud, and Microsoft.

data practitioner, life cycle, use gretel synthetic, (6 more...)

#artificialintelligence

Industry:

Information Technology (0.36)
Health & Medicine > Therapeutic Area (0.34)

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

Getting a Peak of the Big Data/Cloud Computing Workflow Using AWS

#artificialintelligenceOct-28-2022, 08:20:09 GMT

Originally published on Towards AI the World's Leading AI and Technology News and Media Company. If you are building an AI-related product or service, we invite you to consider becoming an AI sponsor. At Towards AI, we help scale AI and technology startups. Let us help you unleash your technology to the masses. Although I've had the chance now to play with these different technologies, I'm still amazed by the convenience, portability, and computing power that Big Data and Cloud Computing technologies offer, both to consumers and businesses.

big data cloud computing workflow, jupyterhub, summary tab, (12 more...)

#artificialintelligence

Genre: Workflow (0.71)

Technology:

Information Technology > Data Science > Data Mining > Big Data (1.00)
Information Technology > Cloud Computing (1.00)
Information Technology > Artificial Intelligence (1.00)

Add feedback

Top 15 Books to Master Data Strategy - KDnuggets

#artificialintelligenceJun-16-2022, 13:07:16 GMT

If you're a data practitioner with your eye on a leadership role, learning Data Management will be an important step toward getting you where you want to go. In this article, we outline 15 books on topics ranging from Data Architecture (highly technical) to Data Literacy (broadly nontechnical) to help you improve your understanding of end-to-end best practices related to data. Summary: I'd be remiss if I didn't begin this list here. This behemoth covers 14 practical topics related to Data Strategy, followed by 3 topics related to implementation. The 14 different knowledge areas are best represented by the Aiken Pyramid, which outlines how these topics build upon each other.

data management, data quality, data strategy, (13 more...)

#artificialintelligence

Industry: Information Technology (0.69)

Technology:

Information Technology > Information Management (1.00)
Information Technology > Communications > Social Media (1.00)
Information Technology > Artificial Intelligence > Machine Learning (0.69)
Information Technology > Data Science > Data Mining (0.69)

Add feedback

A better way to browse the web for data practitioners

#artificialintelligenceDec-17-2021, 16:00:33 GMT

We are trying to imagine the smoothest way to solve this problem and capture the web efficiently, but we need your support and your feedback to make it happen. This is the part where the information comes to you, because we're not always looking for stuff. On your home page, you would be able to access general reading (and watching, and listening) recommendations, based on your interests and latest readings. Tell us what you're working on or where you are stuck and our AI search agent will find the right content. We are the first search engine dedicated entirely to AI content: you may not know a concept, we will suggest and define it for you.

browse, data practitioner, recommendation

#artificialintelligence

Technology: Information Technology > Artificial Intelligence (0.43)

Add feedback

6 productivity tips for beginner data scientists

#artificialintelligenceNov-8-2021, 16:13:28 GMT

Tips that will fast track productivity in your data science journey as a beginner. I could remember, When I wanted to learn data science, machine learning, I was also curious about specific things I need to do to fast-track myself while I just started since having passed that stage and have more experience. I will be sharing some tips that will help beginners in their journey from my experience In data science. In this article, You will understand ways to improve yourself as an aspiring or beginner data scientist. I will explain six important productivity tips to improve yourself as a beginner, junior, undergraduate, or aspiring data scientist.

beginner data scientist, data science, data scientist, (11 more...)

#artificialintelligence

Technology:

Information Technology > Data Science (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

Getting the most from your data-driven transformation: 10 key principles

MIT Technology ReviewOct-14-2021, 16:08:28 GMT

The importance of data to today's businesses can't be overstated. Studies show data-driven companies are 58% more likely to beat revenue goals than non-data-driven companies and 162% more likely to significantly outperform laggards. Data analytics are helping nearly half of all companies make better decisions about everything, from the products they deliver to the markets they target. Data is becoming critical in every industry, whether it's helping farms increase the value of the crops they produce or fundamentally changing the game of basketball. Used optimally, data is nothing less than a critically important asset. Problem is, it's not always easy to put data to work. The Seagate Rethink Data report, with research and analysis by IDC, found that only 32% of the data available to enterprises is ever used and the remaining 68% goes unleveraged.

data-driven transformation, information, journey, (14 more...)

MIT Technology Review

Industry: Information Technology > Security & Privacy (0.69)

Technology:

Information Technology > Data Science (1.00)
Information Technology > Cloud Computing (1.00)
Information Technology > Artificial Intelligence (1.00)

Add feedback